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Complete Genome Sequence of Middle East Respiratory Syndrome 
Coronavirus KOR/KNIH/002_05_2015, Isolated in South Korea 

You-Jin Kim, a Yong-Joon Cho, b Dae-Won Kim, 0 Jeong-Sun Yang, 3 Hak Kim, 3 SungHan Park, 3 Young Woo Han, 3 Mi-ran Yun,° 

Han Saem Lee, 3 A-Reum Kim, 3 Deok Rim Heo, 3 Joo Ae Kim, 3 Su Jin Kim, 3 Hee-Dong Jung, 3 Namil Kim, b Seok-Hwan Yoon, b 
Jeong-Gu Nam, 3 Hae Ji Kang, 3 Hyang-Min Cheong, 3 Joo-Shil Lee, d Jongsik Chun, b Sung Soon Kim 3 

Division of Respiratory Viruses, Center for Infectious Diseases, Korea National Institute of Health, Korea Centers for Disease Control and Prevention, Cheonju-si, South 
Korea 3 ; ChunLab, Inc., Seoul National University, Seoul, South Korea 5 ; Division of Biosafety Evaluation and Control, Korea National Institute of Health, Korea Centers for 
Disease Control and Prevention, Cheongju-si, South Korea 0 ; Korea National Institute of Health, Korea Centers for Disease Control and Prevention, Cheongju-si, South 
Korea 5 

The full genome sequence of a Middle East respiratory syndrome coronavirus (MERS-CoV) was identified from cultured and 
isolated in Vero cells. The viral genome sequence has high similarity to 53 human MERS-CoVs, ranging from 99.5% to 99.8% at 
the nucleotide level. 
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M iddle East respiratory syndrome coronavirus (MERS-CoV) 
is the first betacoronavirus lineage C member isolated from 
humans. It has been assumed that MERS-CoV was transmitted 
from bats and spread to humans through intermediate hosts (1). 
The genome structure is a single-stranded RNA (ssRNA) encod¬ 
ing 10 proteins; two replicase polyproteins (open reading frames 
[ORFs] lab and la), three structural proteins (E, N, and M), a 
surface (spike) glycoprotein (S), and five nonstructural proteins 
(ORFs 3, 4a, 4b, and 5) (2). 

A sputum sample was collected from a second patient on 
20 May 2015. The MERS-CoV was inoculated to Vero cells and 
passed three times. The RNA was isolated from the third viral 
culture solution with the QIAamp viral RNA mini kit (QIAGEN, 
Germany). Reverse transcription was performed with the Super¬ 
script III first-strand synthesis system (Life Technologies, the 
Netherlands) with specific-reverse primers. The cDNA was ampli¬ 
fied by overlapping PCR primers based on a previous study (3). 
Additional PCR primers were designed for nonamplified regions. 
The resulting PCR amplicons were pooled and fragmented to an 
average 300-bp length, and the sequencing library was con¬ 
structed with an Illumina TruSeq Nano DNA sample prep kit 
(Illumina, USA). The sequencing was performed with an Illumina 
MiSeq 50-bp single-end platform (Illumina). A total of 2,814,805 
sequence reads were generated, and 2,617,936 reads (93.01%) 
were mapped to the consensus sequence from human-origin 
MERS-CoV genome sequences retrieved from GenBank. Map¬ 
ping was accomplished by Bowtie version 2.2.4 (4) with default 
parameters. Finally, the whole viral genome sequence was ob¬ 
tained from the mapped result with an average coverage of 
3,605.95 X. Based on the assembly, the genome size was estimated 
to be 30,108 bp with a GC content of 41.15%. 

The sequence analysis of South Korean MERS-CoV was per¬ 
formed with 53 complete genomes of human MERS-CoV avail¬ 


able from GenBank using MUSCLE in the MEGA version 6 pack¬ 
age (5). The full-genome sequence of MERS-CoV/KOR/KNIH/ 
002_05_2015 showed overall nucleotide identities of 99.5% to 
99.8% with 53 human MERS-CoVs. The overall identity to EMC/ 
2012 (accession no. JX869059), the reference genome, was 99.5%. 
The closest strain was Hafr-Al-Batin_l (accession no. KF600628) 
with 99.8% similarity. 

In this analysis, the Korean MERS-CoV includes 29 nucleo¬ 
tides and 12 amino acid variants, compared to 53 full-genome 
sequences for human MERS-CoV. Two specific variations, 
Argl37Ser in the N-terminal domain and Leu530Val in the 
receptor-binding domain, whose spike proteins mediate virus en¬ 
try and affect the viral host range, were identified only in the cell- 
cultured MERS-CoV/KOR/KNIH/002_05_2015 (compare with 
other variation studies of the receptor-binding domain of the 
spike protein [6]). 

Nucleotide sequence accession number. The complete ge¬ 
nome sequence of the MERS-CoV/KOR/KNIH/002_05_2015 iso¬ 
late was deposited in GenBank under the accession number 
KT029139. 
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